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Abstract Recently, considerable effort has been put into developing fast algo- 
rithms to reconstruct a rooted phylogenetic network that explains two rooted 
phylogenetic trees and has a minimum number of hybridization vertices. With 
the standard approach to tackle this problem being combinatorial, the rccon- 
striicted network is rarely unique. From a biological point of view, it is therefore 
of importance to not only compute one network, but all possible networks. In 
this paper, we make a first step towards approaching this goal by presenting 
the first algorithm — called ALlMAAFs — that calculates all maximum-acyclic- 
agreement forests for two rooted binary phylogenetic trees on the same set of 
taxa. 
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1 Introduction 

Over the last decade, significant progress in phylogenetic studies has been 
achieved by combining the expertise acquired in the fields of biology, com- 
puter science, and mathematics. As for the latter, combinatorics is becoming 
increasingly important in approaching many problems in the context of reticu- 
late evolution (e.g., see [Tni[TT| for two excellent reviews) which is an umbrella 
term for processes such as horizontal gene transfer, hybridization, and recom- 
bination. To analyze reticulation in evolution, the graph-theoretic concept of 
an agreement forest for two rooted phylogenetic trees has attracted much at- 
tention (e.g. [2l[7lfT4l[6l[T]). However, most approaches that make use of this 
concept aim at quantifying the amount of reticulation that is needed to simul- 
taneously explain a set of rooted phylogenetic trees. Thus, one is primarily 
interested in the number of horizontal gene transfer, hybridization, or recom- 
bination events that occurred during the evolution of a set of present-day 
species. Consequently, these approaches do not explicitly construct a rooted 
phylogenetic network that explains a set of phylogenetic trees. Nevertheless, 
this is desirable from a biological point of view because such a network in- 
tuitively indicates how species may have evolved by means of speciation and 
reticulation. While each vertex of a phylogenetic tree has exactly one direct 
ancestor, a vertex of a phylogenetic networks may have more than one such 
ancestor; thereby indicating that the genome of the underlying species is a 
combination of the genomes of distinct parental species. Generically, we refer 
to such a vertex as a reticulation vertex or, more specific in the context of 
hybridization, as a hybridization vertex. Since reticulation events are assumed 
to be significantly less frequent than speciation events, current research aims 
at constructing a rooted phylogenetic network that explains a set of rooted 
phylogenetic trees and whose number of reticulation vertices is minimized. 

For the purpose of the introduction, think of a so-called maximum-acyclic- 
agreement forest T for two rooted binary phylogenetic trees S and T as a 
small collection of vertex-disjoint rooted subtrees that are common to S and 
T (for details, see Section [2|. It is well-known that the size of T minus 1 
equates to the minimum number of hybridization events that are needed to 
explain S and T [2]. Furthermore, there exists an algorithm — called Hybrid- 
Phylogeny [3] — that glues together the elements of T by introducing new 
edges such that the resulting graph is a rooted phylogenetic network that ex- 
plains S and T and has \T\ - 1 hybridization vertices. However, until now, 
HybridPhylogeny, has not found its way into many practical applications 
that are concerned with reconstructing the evolutionary history for a set of 
species whose past is likely to include hybridization. This might be due to the 
fact that the reconstructed phylogenetic network is rarely unique because the 
gluing step can often be done in a number of different ways. Furthermore, given 
two rooted binary phylogenetic trees S and T, a maximum-acyclic-agreement 
forest for S and T is rarely unique. Given these hurdles, an appealing open 
problem is the reconstruction of all rooted phylogenetic networks that explain 
a pair of rooted phylogenetic trees and whose number of hybridization ver- 
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tices is minimized. Once having calculated the entire solution space of these 
networks, one can then for example apply statistical methods or additional 
biological knowledge to decide which of the phylogenetic network in this space 
is most likely to be the correct one. 

In this paper, we focus on a first step to reach this goal. In particular, 
we give the first non-naive algorithm — called ALlMAAFs — that is based on 
a bounded-search type idea and calculates all maximum-acyclic-agreement 
forests for two rooted binary phylogenetic trees S and T on the same set 
of taxa. With the underlying optimization problem being NP-hard [5] and 
fixed-parameter tractable |3] , the running time of ALlM A AFs is exponential. 
More precisely, we will see in Section [5] that the running time of of this al- 
gorithm is 0(3^^'' +p(n)), where n is the number of leaves in S and T, p{n) 
is some polynomial function that only depends on n, and k is the minimum 
number of hybridization events needed to explain 5* and T. 

The paper is organized as follows. The next section contains preliminar- 
ies and some well-known results from the phylogenetics literature. Section [3] 
describes the algorithm ALlMAAFs that calculates all maximum-acyclic- 
agreement forests for two rooted binary phylogenetic trees. Its pseudocode 
is also given in this section. Subsequently, in Section |4j we establish the cor- 
rectness of ALlMAAFs and give its running time in Section [5] We finish the 
paper with some concluding remarks in Section [6] 

2 Preliminaries 

In this section, we give some preliminary definitions that are used throughout 
this paper. Notation and terminology on phylogenetic trees and networks fol- 
low [12] and [S], respectively. 

Phylogenetic trees. A rooted phylogenetic X-tree T is a connected graph 
with no (undirected) cycle, no vertices of degree 2, except for the root which 
has degree at least 2, and such that each element of X labels a leaf of T. The 
set X represents a collection of present-day taxa and internal vertices represent 
putative speciation events. A rooted phylogenetic A'-tree T is said to be binary 
if its root vertex has degree two while all other interior vertices have degree 
three. We denote the edge set of T by E{T). The taxa set A" of T is called 
the label set of T and is frequently denoted by C{T). Furthermore, let w be a 
vertex of T. We denote by C{v) the label set of the rooted phylogenetic tree 
with root V that has been obtained from T by deleting the edge ending in v. 
Lastly, let be a set of rooted phylogenetic trees. Similarly to C{T), we use 
£(JF) to denote the union of leaf labels over all elements in T. 

We next introduce several types of subtrees that will play an important 
role in this paper. Let T be a rooted phylogenetic A'-tree, and let X' c X he 
a subset of X. We use T{X') to denote the minimal connected subgraph of T 
that contains all leaves that are labeled by elements of X' . Furthermore, the 
restriction of T to X' , denoted by T\x', is defined as the rooted phylogenetic 
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tree that has been obtained from T{X') by suppressing all non-root degree-2 
vertices. Lastly, we say that a subtree of T is pendant if it can be detached 
from T by deleting a single edge. 

Now, let T be a rooted binary phylogenetic A'-tree, and let X' be a sub- 
set of X . Then, the lowest common ancestor of X' in T is the vertex w in T 
with X' c C{v) such that there exists no vertex v' in T with X' £ C{v') and 
C{v') c C{v). We denote v by IcarCA"). 

Hybridization networks. Let X he a finite set of taxa. A rooted phyloge- 
netic network on A" is a rooted acyclic digraph with no vertex of both indegree 
and outdegree one and whose leaves are bijectively labeled by elements of 
X. Since this paper is concerned with hybridization as a representative of 
reticulation, we will often refer to a phylogenetic network as a hybridization 
network. Each internal vertex of a hybridization network with indegree 1 rep- 
resents a putative speciation event while each vertex with indegree of at least 
2 represents a hybridization event and, therefore, a species whose genome is 
a chimaera of its parents' genomes. Generically, we call a vertex of the latter 
type a hybridization vertex and each edge that enters a hybridization vertex a 
hybridization edge. 

To quantify the number of hybridization events, the hybridization number 
of N, denoted by h{N), is defined as 

HN)= Z iS-iv)-l) = \E\-\V\ + l, 

veV(N):S-(v)>0 

where V{N) denotes the vertex set of N and 6^{v) the indegree of v. Note 
that, if iV is a rooted phylogenetic tree, then h{N) = 0, and if S~{v) is at 
most 2 for each vertex v e V{N), then h{N) is equal to the total number of 
hybridization vertices of N. 

Now, let be a phylogenetic network on X, and let T be a rooted binary 
phylogenetic X'-tree with X' c X. Wc say that T is displayed by if T can be 
obtained from A^ by deleting a subset of its edges and any resulting degree-0 
vertices, and then contracting edges. Intuitively, if A^ displays T, then all of the 
ancestral relationships visualized by T are visualized by A^. In the remainder 
of this paper, we will consider the case where T is composed of two rooted 
binary phylogenetic trees. 

Extending the definition of the hybridization number to two rooted binary 
phylogenetic A'-trees S and T, we set 

h{S,T) = nim{h{N) : A^ is a hybridization network that displays 5* and T}. 

Calculating h{S,T) for two rooted binary phylogenetic A'-trees has been shown 
to be NP-hard 15j. 

Forests. Let T be a rooted binary phylogenetic A'-tree whose edge set is 
E(T). For the purpose of the upcoming definitions and, indeed, much of the 
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Fig. 1 (j) Two pliylogenetic trees S and T on A' = {a,b,c, d, e, f, g}. (ii) An agreement 
forest for S and T. (m) The graph AG(S,T,J^). Since this graph contains a directed 
cycle, is not an acyclic-agreement forest for S and T. 

paper, we regard the root of T as a vertex labeled p at the end of a pendant 
edge adjoined to the original root of T. For an example of two such trees, 
see Figure [iji). Furthermore, we view p as an element of the label set of T; 
thus £{T) = Xu{p}. Any collection of rooted binary phylogenetic trees whose 
union of label sets is C{T) is a forest on C{T). Furthermore, we say that a 
set T = {Fq, Fi, . . . , Ffc} of rooted binary phylogenetic trees, with \T\ referred 
to as the size of J^, is a forest for T \i T can be obtained from T by delet- 
ing a fc-sized subset E of E{T) and, subsequently, suppressing vertices with 
both indegree and outdegree 1. To ease reading, we write T = T - E \l T 
can be obtained in this way. Obviously, in the same way, we obtain a new 
forest T' = {Fl^,Fi...,Fl,} for T from T by deleting a fc'-sized subset E' of 
{JPiij^ E{Fi) and, again, suppressing vertices with both indegree and outde- 
gree 1. Similarly to the above, we write T' = J- - E' li T' can be obtained 
in this way. Now, let be a forest for a rooted binary phylogenetic A'-tree. 
We use T to denote the forest obtained from T by deleting all of its isolated 
vertices and, additionally, the element that contains the vertex labeled p if it 
contains at most one edge. Lastly, for two leaf vertices a and c with labels 
C{a) and £(c) respectively, we write a '^jr c\i there exists an element in T 
that contains a leaf labeled with £(a) and a distinct leaf labeled with £(c), 
otherwise, wc write a c. 

Let S and T be two rooted binary phylogenetic A'-trees. A set = 
{Fp, Fi,F2, . . . , F]^} of rooted phylogenetic trees is an agreement forest for S 
and T if is a forest for S and T, and p e C{Fp). Note that the before- 
hand given definition is equivalent to the definition of an agreement forest 
that is usually used in the literature and that we give next. An agreement 
forest T - {Fp, Fi, F2, . . . , F^} for S and F is a collection of trees such that 
the following properties are satisfied: 

(i) The label sets C{Fp),C{Fi),C{F2), . . . ,C{Fk) partition X u {p} and, in 
particular, p € C{Fp). 

(ii) For each i € {p, 1, 2, . . . , k}, we have F^ = S'|£(i?.) = T\^(^p.y 
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(iii) The phylogenetic trees in {S{£{Fi)) | i e {p, 1,2, . . .,k}} and {T{/:{F,)) \ 
i = {p,l,2, . . . , k}} are vertex-disjoint subtrees of S and T, respectively. 

Both definitions of an agreement forests for two rooted binary phylogenetic 
trees are used interchangeably throughout this paper. 

An agreement forest with the minimum cardinality among all agreement 
forests for S and T is called a maximum- agreement forest for S and T. An 
example of an agreement forest for the two trees S and T presented in Fig- 
ure [T|i), is shown in (ii) of the same figure. It is easy to check that this forest 
is in fact a maximum-agreement forest for S and T. 

A characterization of the hybridization number h{S,T) for two rooted 
binary phylogenetic trees S and T in terms of agreement forests requires an 
additional condition. Roughly, this condition avoids that species can inherit 
genetic material from their own offsprings. Let T = {Fp, Fi, F2, . . . , Fk} be 
an agreement forest for two rooted binary phylogenetic X-trees S and T. 
Furthermore, let AG{S,T,!F) be the directed graph whose vertex set is J- and 
for which {Fi,Fj) is an arc precisely if i + j, and either 

(1) the root of S{C{Fi)) is an ancestor of the root of S{C{Fj)) in 5*, or 

(2) the root of T{C{Fi}) is an ancestor of the root of T{C{Fj)) in T. 

We call T an acyclic-agreement forest for S and T if AG{S, T, T) does not con- 
tain any directed cycle. To illustrate, Figurejljiii) shows the graph AG{S^ T, !F) 
for 5, r, and T of the same figure. Note that T is not an acyclic-agreement 
forest for S and T. Similarly to the definition of a maximum-agreement forest, 
an acyclic- agreement forest for S and T whose number of components is mini- 
mized over all such forests is called a maximum- acyclic- agreement forest for S 
and T. The importance of the concept of acyclic-agreement forests lies in the 
following theorem that has been established in % Theorem 2] and gives an 
attractive characterization of the hybridization number for two rooted binary 
phylogenetic trees. 

Theorem 1 Let T = {Fp, Fi, F2, . . . , F^} be a maximum-acyclic-agreement 
forest for two rooted binary phylogenetic X -trees S and T . Then 

h{S,T) = k. 

In the proof of TheoremjlJ the authors implicitly show that, by deleting all hy- 
bridization edges of a hybridization network N that displays two rooted binary 
phylogenetic A'-trees S and T and has a minimum number of hybridization 
vertices and, subsequently, suppressing all non-root degree-2 vertices, one ob- 
tains a maximum-acyclic-agreement forest J- for S and T. Note that is well- 
defined for when N is given. We say that N yields T . On the other hand, given 
a maximum-acyclic-agreement forest T for two rooted binary phylogenetic 
rY-trees S and T, and using the algorithm HybridPhylogeny (for details, 
see [3]) to construct a hybridization network N from T that displays S and T 
and yields T , N is rarely unique. Nevertheless, if one aims at reconstructing all 
hybridization networks that display S and T and whose hybridization number 
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Fig. 2 Tlie two pliylogenetic trees obtained by calling CHERRyReduction(5, T, 0, {/, g}), 
where S and T are the two phylogenetic trees shown Figure 111 i). 



is minimized, one can first calculate all maximum-acyclic-agreement forests for 
S and T and then construct all possible minimum hybridization networks for 
each such forest. As mentioned in the introduction, this paper focuses on the 
first step of this approach, i.e. finding all maximum-acyclic-agreement forests 
for 5* and T. 

Now, let T he a set of rooted binary phylogenetic trees, and let a and c be 
two distinct leaves of J-. We say that a and c form a cherry in if they are ad- 
jacent to a common vertex, in which case we denote this cherry by {a, c}. Note 
that a and c refer to leaf vertices and not leaf labels. Let S and T be two rooted 
binary phylogenetic A'-tree, and let be a forest for T. Furthermore, let {a, c} 
be a cherry of S\^^-^y We say that {a, c} is a contradicting cherry of S and if 

there is no cherry {a', c'} in such that one of a' or c', say a', is labeled C{a) 
while c' is labeled C{c). Otherwise, we call {a, c} a common cherry of S and J^. 

Cherry reduction. Let be a forest for a rooted binary phylogenetic tree, 
and let {a, c} be a cherry of !F. The operation of deleting the two leaf vertices a 
and c and their respective labels and labeling the resulting new leaf vertex with 
£(a)u£(c) is called a cherry reduction. The new label C{a)uC{c) is sometimes 
referred to as a dummy taxon. We denote this reduction by T[{C{a) , C{c)} 
C{a) u>C(c)]. Reversely, we denote by JF[£(a) u >C(c) ->■ {>C(a), £(c)}] the 
operation of adjoining the vertex labeled C{a) u £(c) with two new vertices 
labeled C{a) and C{c), respectively, via two new edges and deleting the label 
C{a) u C{c). For an example of a cherry reduction, consider the two phyloge- 
netic trees S and T of Figurejlji) that have a common cherry {/,(;}. Reducing 
this cherry in S and T results in the two phylogenetic trees that are shown in 
Figure [2] 

We end this section with an important remark. 

Remark 1 The newly created leaf label, that results from applying a cherry 
reduction to a cherry {a, c} that is common to two rooted phylogenetic trees, 
is the union of the labels associated with the vertices a and c. For the rest 
of this paper, we therefore assume that the forest J- before applying a cherry 
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reduction and the forest T' that resuhs from applying such a reduction have 
the same label set although the number of leaves has been decreased by one; 
thus ll{T^ = C{T'). Furthermore, we write to denote the number of 

labeled vertices in T. Clearly, this number is always one greater than the 
number of leaves in T due to the vertex labeled p. Lastly, let S" be a rooted 
binary phylogenetic tree. We write 1{S) = 1{J^) if the number of labeled vertices 
in S and T is identical and if there is a bijection between the vertex labels of 
S and T . 

3 The algorithm ALlMAAFs 

In this section, we first give a brief outline of the algorithm ALlMAAFs that 
calculates all maximum-acyclic-agreement forests for two rooted binary phy- 
logenetic trees and, subsequently, present its pseudocode. Before doing so, we 
however start with an important remark to emphasize how the algorithm pre- 
sented in this section separates itself from previously published work, and give 
some additional definitions. 

Remark 2 While ALlMAAFs has a similar flavor as an algorithm presented 
in |14j that has been further improved in |13) . we remark here that our al- 
gorithm contains significant modifications due to a problem in both papers. 
In particular, Whidden et al.'s algorithms are based on a different definition 
of an acyclic- agreement forest T for two rooted binary phylogenetic A'-trees 
S and T compared to the definition that we have given in Section [2j Trans- 
lated into the language of this paper, they define T to be acyclic precisely if 
AG{S,T,!F) does not contain a directed cycle of length 2. Of course, this does 
not eliminate the possibility of cyclic inheritance in general although this is 
essentially required from a biological point of view. While ALlMAAFs con- 
siders this stronger constraint and calculates a maximum-acyclic-agreement 
forest as defined in Section [2] we additionally show that our algorithm also 
computes all such forests (see Section |4]). 

Let be a rooted binary phylogenetic X-tree, and let ^ be a forest such 
that 1{S) = 1{T). Let {a, c} be a cherry of S\j^fjpy We denote by Co the edge of 
T that is incident with the leaf vertex, say a', labeled /C(a), and by Cc the edge 
of T that is incident with the leaf vertex, say c', labeled C{c). Furthermore, 
if {a, c} is a contradicting cherry of S and T and a c, let Fi be the unique 
element of ^ such that C{a) c C{Fi) and £{c) c C{Fi). Let a' ,Vi,V2, . . . ,'y„,c' 
be the path of vertices from a' to c' in Fi . We define es = {u, w} to be an edge of 
Fi such that u e {vi, W2, ■ • ■ , v i {a', vi,V2, ■ ■ ■ ,Vmc'}, and u is an ancestor 
of V in Fi. An example of an edge es is shown in Figure [sji), where ei is such an 
edge for the contradicting cherry {a,b} of the two topmost phylogenetic trees 
of that figure. Now, an edge e of ^ is said to be associated with a contradicting 
or common cherry {a,c} for S and if one of the following holds: 
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1. e 6 {eajGc} if {a,c} is a common cherry of S and or {a,c} is a contra- 
dicting cherry of S and and a 'Pyr c, 

2. e 6 {ca, 63,62} if {a, c} is a contradicting cherry of S and T and a ~;fr c. 

We next describe the pseudocode of ALlMAAFs. The algorithm takes as 
input two rooted binary phylogenetic X-trees S and T, a rooted binary phy- 
logenetic tree R and a forest T such that 1{R) = and C{T) = C{T), 

an integer fc, and a hst M that contains information of previously reduced 
cherries. The output of ALlMAAFs is a set of forests for T and an in- 
teger k. We will see in Section |4j that if the input to ALlMAAFs are two 
rooted binary phylogenetic A'-trees S and T, R = S, = T, and M = 0, 
then ^ precisely contains all maximum-acyclic-agreement forests for S and 
T and their respective hybridization number if and only if fc > h(S,T). We 
will therefore assume for the rest of the description of the pseudocode that 
ALLMAAFs(S',r,i?,:P,fc,0) has initially been called for i? = S", J" = T, and 
M = 0. If A; < 0, the algorithm immediately stops and returns an empty set. If, 
on the other hand, fc > and 1{R) = 0, then a forest T' is obtained from JF by 
calling CHERRYExpansion(^, M); that is undoing all previously performed 
cherry reduction. As we will soon see in Lemma [2j JF' is an agreement forest 
for S and T. Thus, if the graph AG{S,T,!F') is acyclic, then T' is an acyclic- 
agreement-forest for S and T, and the algorithm returns !F' and \!F'\ - 1 with 
the latter being the hybridization number for S and T if ^ is of smallest size. 

Otherwise, if fc > and 1{R) > 0, the algorithm proceeds in a bounded- 
search type fashion by recursively deleting an edge in J-^ or reducing a common 
cherry by calling CHERRYReduction until the resulting forest is a forest for 
S and T. More precisely, each recursion starts by picking a cherry in R. Since 
1{R) > 0, a cherry, say {a, c}, always exists since, by definition of !F, we have 
1{R) > 2. Depending on whether {a,c} is a contradicting or common cherry 
of R and J^, and on whether or not a and c are vertices of the same com- 
ponent in J^, the algorithm branches into at most three computational paths 
by recursively calling ALlMAAFs. Note that the number of edge deletions 
that can additionally be performed at each step of the algorithm is given by 
the fifth parameter of each call to ALlMAAFs. In the following, we say that 
a computational path corresponds to deleting an edge in T if ALlMAAFs is 
recursively called for a forest, say JF', that has been obtained from deleting an 
edge, and R\ii(^y Similarly, we say that a computational path corresponds to 
calling CHERRyReduction if ALlMAAFs is recursively called for a tree and 
a forest that are returned from a call to cherryReduction. 

Now, regardless of whether {a, c} is a contradicting or common cherry of 
R and ALlMAAFs branches into two new computational paths that cor- 
respond to deleting and Cc in JF, respectively. Additionally, if {a,c} is a 
contradicting cherry of R and T and a c, then ALlMAAFs branches into a 
third computational path that corresponds to deleting an edge cb va. T . Simi- 
larly, if {a, c} is a common cherry of R and T, then ALLMAAFs branches into 
a third path that corresponds to caUing CHERRYREDUCTiON(i?, T, M, {a, c}). 
Intuitively, if {a, c} is a contradicting cherry of R and J^, then, to obtain an 
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agreement forest for the inputted trees S and T, one needs to delete at least 
one of Ca, Gc and cb- Otherwise, if {a,c} is a common cherry of R and 
then, to obtain an acyclic-agreement forest, say !F' for 5* and T, either the 
labels of a and c label vertices of the same component in J^' , which is mim- 
icked by calling CHERRyReduction for a and c, or the labels of a and c are 
contained in the label sets of two distinct elements in thus one needs to 
delete one of Ca or Cc- Noting that a common cherry of R and T is not neces- 
sarily a common cherry of S and T, we remark that this part of the algorithm 
has a similar flavor as [H Lemma 3.1.2], where the authors consider so-called 
common chains of S and T with at least 3 leaves. To keep track of the set 

of maximum-acyclic-agreement forests for the two initially inputted rooted 
binary phylogenetic A'-trees S and T, the algorithm uses the variable /cmin 
which is set to be the minimum number of edges that have so far been deleted 
in so that the resulting forest is an acyclic-agreement forest for R and !F, 
and updates and fcmin as appropriate throughout a run of ALlMAAFs. 

We end the description of the pseudocode by noting that ALlMAAFs al- 
ways terminates because, at each recursive call, either k is decreased by one 
or the number of leaves in R is decreased by one due to calling CHERRyRe- 

DUCTION. 



Algorithm 1: CHERRYREDUCTiON(i?, J", Af , {a, c}) 

Data: A rooted binary phylogenetic tree R and a forest such that 1{R) = a 
list M that contains all information of previously applied cherry reductions, 
and a common cherry {a, c} of _R and J^. 
Result: A rooted binary phylogenetic tree R' and a forest J^' obtained from R and 
J^, respectively by replacing {a, c} with a single leaf with a new label 
C{a) u C{c), and an updated list M' . 

1 M' ^ Add {C(a),C(c)} as last element of M; 

2 R' ^ R[{C{a),C(c)} C{a)uC{c)]; 

3 T' ^ y^[{C{a),C(c)} ^ C(a)uclc)]; 

4 return (R' ,T',M') 



Algorithm 2: CHERRyExpansion( J", M) 



Data: A forest T and a list M containing information of all previously applied 
cherry reductions. 

Result: A forest T whose vertices labeled with dummy taxa have been replaced by 
the corresponding cherries using the information contained in M . 

1 while M is not empty do 

2 A/ <- remove the last element, say {C{a),C{c)}, from M; 

3 \^ r^r[C{a)uC{c)^{C{a),C{c)}\, 

4 return T 
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Algorithm 3: ALlMAAFs(,S'. T. R. T. k. M) 



Data: Two rooted binary phylogenetic A'-trees S and T, a rooted binary 

phylogenetic tree R and a forest such that 1{R) = i(^) and C(T) = C{T), 
an integer k, and a list M that contains information of previously reduced 
cherries. 

Result: A set ^ of forests for T and an integer. In particular, if !F = T, R = S, 

M = 0, and k > h{S,T) is the input to ALlMAAFs, the output precisely 
consists of all maximum-acyclic-agreement forests for S and T and their 
respective hybridization number. 

1 if fc < then 

2 |_ return {0,k- 1); 

3 if mR)\ = then 



9 else 
10 
11 

12 
13 

14 

15 
16 

17 
18 
19 
20 
21 
22 
23 

24 
25 

26 
27 

28 
29 
30 
31 

32 
33 
34 

35 
36 

37 
38 
39 
40 



J^' ■>- cherryExpansion(.7=', M); 
if AG{S,T,J^') is acyclic then 
|_ return {J^', \J^'\ - 1); 

else 

1^ return (0,fe- 1); 

let {a, c} be a cherry of R; 

{:Fa, ka) *- allMAAFs(5, T, T-{ea}, k-1, M); 

if T'ai' then 
|_ fc' ^mm{k,ka + l-\T\); 

(Tc, kc) allMAAFs(5, T, fi|^( jr_{e^} ),-^-{ec}, k' - 1, M); 
\f :Fc + then 

|_ fc'^min(A;',fec + l-|.7^|); 

0; 

if {a, c} is a contradicting cherry of R and T then 
if a +jr c then 

if (fca + 1 - \r\ = femin) then :F = :F„; 

if ikc + l-\T\ = fcmin) then J= = J=uJ=c; 
return {T, fcmin - 1); 

else 



fca) ^ allMAAFs(S, T, R\^^^r:^^y 
if :Fb * then 

L fcmin ^ min(fc',fcs + 1 - 

if (fca + 1 - \r\ = fcmin) then :F = J^^: 
if {kB + 1 - |:F| = fcmin) then T = T^Tb; 
if (fcc + 1 - \T\ = fcmin) then :F = :F u :Fc; 
return (.^, femin - 1) 



^-{en}, fc'-l, M); 



else 



(R',T',M') *- cherryReduction(R, T, M, {a,c}); 
(Vr, kr) *- allMAAFs(5, T, R'l^^j^y T', k', M'); 
if :Fr*0 then 
L '=min min(fc', fe^ + 1 - \J^\); 

if (fca + 1 - \T\ = femin) then :F = y^a] 
if (fcc + 1 - = fcmin) then :F = :Fu:Fc; 
if (fcv + 1- = fcmin) then T = TuTr; 
return {:F, fcmin - 1); 



12 



Celine Scornavacca et al. 




a h c d e {f.g,' abed (f.g! e 




a b c d e (f.g,' a h {f,gj- e c d 



(ii) 




a h c d f g e 

(vii) 



Fig. 3 An example of a call to pr0CESSCherries(5, T,Ai,A2, . . . ,A6), where S and T are the 
phylogenetic trees of Figure[l[i) and the cherry list is (({/,9},0), {{o,tb},ei), ({a,b},0), 
({{a, 6}, c}, 62), ({c, d}, 63) ,({{/, g}, e}, 0)). In (i)-(vi), the phylogenetic trees and the 
forests are shown that are obtained by successively analyzing each cherry action of the 
above list while in (vi) the result of the call CHERRyExpansion(.F, M) is shown, where T is 
the forest that is depicted in (vi) and M contains all the information of previously applied 
cherry reductions. Note that the forest in (vii) is a maximum-acyclic-agreement forest for S 
and T. 
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Fig. 4 Two non-super-forests of tlie forest T in Figure [l|ii). The forest on the left-hand side 
is not a super-forest of T because there exists no subset T' of T such that C{T') = {a}. The 
forest on the right-hand side is not a super-forest of T because there exists no component 
Fi in T such that C{Fi) 3 {a,fe,c}. 



4 Correctness of the algorithm ALlMAAFs 

In this section, we prove the main result of this paper. In particular, we show 
that the algorithm ALlMAAFs calculates all maximum-acyclic-agreement 
forests for two rooted binary phylogenetic trees S and T for when inputted 
with R = S, T = T, M = 0, and k > h{S,T). We start with some additional 
definitions that are crucial for what follows. 

Let and G be two forests such that C{J^) = C{Q). We call Q a super-forest 
of T if and only if the following two conditions are satisfied: 

(1) for each Gj e Q, there exists a subset JF' of T such that C{T') = C{Gj), 
and 

(2) for each leaf vertex a in an element of G, there exists a component Fi in 
such that C{Fi) 3 C{a). 

For an example of two forests that are no super-forests of the forest that is 
shown in Figure [ijii) , see Figure |4] 

The next observation is an immediate consequence of the previous defini- 
tion. 

Observation 1 Given an acyclic- agreement forest T for two rooted binary 
phylogenetic X -trees S and T , then S and T are both super-forests for T . 

Let i? be a rooted binary phylogenetic tree and let be a forest such that 
1{K) = l{^)- Furthermore, let {a, c} be a cherry of R. In the following, we say 
that a pair a= ({a,c},e) is a cherry action if one of the following conditions 
is satisfied: 

(1) {a,c} is a contradicting or common cherry of R and T and e is an edge 
associated with {a,c}, or 

(2) {a, c} is a common cherry of R and T and e = 0. 

Finally, we say that A= (ai,A2, . . .A;) is a cherry list for R and T if and 
only if each Aj is a cherry action in iteration i of the following algorithm; i.e. 
processCherries does not return false: 

PROCESSCHERRIES(i?, J", (Ai,A2, . . . ,A;)) 
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M ^ 0; 

for each i = 1, . . . , ? 

({a,c},ei) ^A^; 

if {a, c} is a common cherry of R and T and = 

{R,T,M) ^ CHERRYREDUCTION(i?, ^, M, {a,c}); 
else if {a,c} is a common or contradicting cherry of R and and is associated with {a, c} 

^ ^ J" - {ej; 

else 

return (false); 
T cherryExpansion(:F, M); 
return {R,T ^M); 

Remark 3 The algorithm processCherries is mimicking a computational 
path of the algorithm ALlMAAFs for when the former algorithm is given a 
cherry list for R and T. A specific example of a call to processCherries 
is shown in Figure [3] with a detailed description given in the caption of this 
figure. 

In what follows, we will sometimes make use of the algorithm 
PROCESSCHERRiES(i?, JF, A), but without executing the call to CHERRyEx- 
PANSION in the second-to-last line of this algorithm. We refer to this slightly 
difi'erent algorithm as PROCESSCHERRlES*(i?, J", A) and to the returned for- 
est as a reduced forest. Now, let be the forest obtained from calling 
PROCESSCHERRiES(i?, JF, A), and let T" be the forest obtained from calling 
PROCESSCHERRiES*(i?, .F, A). We say that T' is the underlying forest for T" 
and observe that \T'\ = 

We continue with two important remarks. 

Remark 4 Applying PROCESSCHERRiES*(i?,.F, A), returns a tree R that does 
not contain any vertex if and only if, prior to calling CHERRYExpansion(.7-", M), 
the forest T only consists of isolated vertices and possibly an element that pre- 
cisely contains a vertex labeled p that is attached to a vertex by an edge; i.e. 
T = (for an example, see Figure [sjjvi)). 

Remark 5 By the definition of T, note that applying processCherries* 
never returns a tree R that consists of a single leaf attached to the root vertex 
labeled p. 

Now, let Q and T be two forests such that Q \s & super-forest of !F. Fur- 
thermore, let e be an edge and {a,c} be a cherry (if it exists) of Q. We say 
that e is a had choice for Q and T \i Q - {e} is not a super-forest of T . Note 
that Q - {e} always satisfies Condition (2) in the definition of a super-forest. 
Similarly, we say that {a,c} is a bad choice for Q and if the forest, say C/', 
that that is obtained from Q by reducing the cherry {a, c} to a new leaf is not a 
super-forest of T. Note that Q' always satisfies Condition (1) in the definition 
of a super- forest. 
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We next prove two lemmas that are necessary to establish the main result 
(Theorem [2| of this paper. 

Lemma 1 Let A be a cherry list for two rooted binary phylogenetic X -trees 
S and T , and let T be an maximum- acyclic- agreement forest for S and T . 
Additionally, let S' and Q' be the tree and the forest, respectively, that have been 
obtained from ca^/m;? PROCESSCherries *(S', T, A), and letQ be the underlying 
forest for Q' . If Q' is a super-forest for T , then S' contains at least one cherry 
or Q = T . 

Proof Suppose that this is not true. Thus, 1{S') = (see Remarkjs]) and there 
exists an element in T that is not an element in Q. Furthermore, since both 
forests are forests of T, we cannot have that there exists Fi^T and Gj 6 Q such 
that C{Fi) = C{Gj) and Fi ^ Gj. Since G' is a super-forest for ^ other than JF, 
there exist at least two components Fi and Fj of T such that C{Fi) u£{Fj) £ 
C{Gk): where Gk is an element of Q. Furthermore, since 1{S') = 0, we have 
G' = (see Remarkji]). Now, since Gk e G, either Gk is an isolated vertex a such 
that £{a) = C{Gk) 3 {C{Fi) u£{Fj)), or Gk is a leaf vertex that is attached to 
the vertex labeled p by an edge such that C(a)u{p} = C(Gk) 3 {C(Fi)uC{Fj)). 
Since neither C{Fi) = {p} nor C{Fj) = {p} (see [H Lemma 1]), G' does not 
fulfill Condition (2) in the definition of a super-forest; a contradiction. 

Lemma 2 Let S and T be two rooted binary phylogenetic X -trees, and let T 
be a forest that is returned from calling CHERRyExpansiON (line 4 of the pseu- 
docode of Algorithm^ while executing ALlM.AAFs{S,T,S, T,k,0). Then, T 
is an agreement forest for S and T . 

Proof Let T' be the forest for which calling CHERRyExpansion returns T. 
By construction, ^ is a forest for T. Now, for the purpose of deriving a con- 
tradiction, assume that is not a forest for S. Since ^ is a forest for T and 
C{S) = C{T), the label sets of the elements in partition £-{S). Thus, it is 
sufficient to consider the following two cases: 

Case (1). Assume that there exists an element Fi in T such that S\c{Fi) $ Fi. 
Since /(i?) = (line 3 of the pseudocode of Algorithm [s]) , we have by Re- 
mark W that J^' = 0. This implies that the element of with leaf sets C{Fi) 
has been shrunk to a single vertex or to a single leaf that is attached to the 
vertex labeled p by an edge. But, since S\c(Fi) t Fi, one of the cherry reduc- 
tions that has been used to shrink T|£(^.) is called for a cherry that is not a 
cherry of R, where R is the tree that is considered in some recursive call of 
ALlMAAFs(S', T, S, T, k, 0) (see pseudocode of Algorithm [s]); a contradiction. 

Case (2). Assume that there exist two elements Fi and Fj in such that 
S{C{Fi)) and S{C{Fj)) are not vertex-disjoint in S. Let S' = S\c(Fi)uC(Fj)- 
For example, the simplest case is shown in Figure [S] where the subtrees in 
white are part of Fi and the ones in black of Fj . In general, a straightforward 
check now shows that it is not possible to shrink both r|£(Fi) f^nd T\£(^p.^ to 
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Fig. 5 An example of a rooted phylogenetic tree S" that is used in Case (2) of the proof of 
Lemma [2] (for details, see text). 



two distinct single vertices in T' (one possibly being attached to the vertex 
labeled p) by using cherry reductions because to shrink one of the two compo- 
nents to a single vertex it is necessary to cut a subtree of the other component, 
thereby contradicting that Fi and Fj are both elements in T . Referring back 
to Figure [5| Fj cannot be shrunk to a single vertex by using a list of cherry 
reductions without cutting 6*1. 

Combining both cases establishes the lemma. 

Theorem 2 Let S and T he two rooted binary phylogenetic X -trees. Calling 

ALlMAAFsCS", T, S*, T, fc, 0) 

returns all maximum- acyclic- agreement forests for S and T if and only if k> 
hiS,T). 

Proof By Lemma |2] each forest that is calculated in the course of executing 
ALlMAAFs(S', T, S, T, k, 0) and checked for acyclicity (see hue 5 of the pseu- 
docode of Algorithmjs]) is an agreement forest for S and T. Thus, if fc > h{S, T) , 
each forest that is returned from running the algorithm is an acyclic-agreement 
forest for S and T. Moreover, since k is updated to take advantage of the size 
of the best solutions that previous recursive calls have found (lines 13, 16, 27 
and 36), only maximum-acyclic-agreement forests are returned. It is therefore 
sufficient to show that each maximum-acyclic-agreement forest for S and T is 
returned by the algorithm. 

Let ALlMAAFs(S',T, 5,T, fc,0) be a caU of Algorithm [sj and, for each 
I 6 {1, 2, . . . , h{S^ r) + l}, let Qi be the set of all reduced forests of size I that have 
been computed by executing this call. In other words, Qi precisely contains all 
forests that are used as a parameter in a recursive call to ALlMAAFs in lines 
11, 14, 25 and 34 of the pseudocode and, in particular, T e Qi. Furthermore, 
let !F he SL maximum-acyclic-agreement forest for S and T. We will prove that, 
for each I e {1,2, ... , h{S, T) + 1}, the set Qi contains a reduced forest Q' that 
is a super-forest for J-. This implies that Gh{s.T)+i contains a reduced forest Q' 
that is a super-forest of T such that = \G\, where G is the underlying forest 
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of Q' . Hence, as G and !F are both forests for T, it follows that Q is isomorphic 
to !F, thereby establishing the theorem. 

We proceed by induction on Z. If / = 1, then the result follows from Ob- 
servation [T] and because T eQi. Now suppose that the result holds whenever 
I < h{S, T) . We will next show that the claim holds for / + 1. Let Q' be a reduced 
forest Q' of size I such that Q' is a super-forest for T. By the induction as- 
sumption, Q' exists. Let Q be the underlying forest of Q' . Furthermore, let 
be the cherry hst that has been used by calhng ALlMAAFs(5', T, S, T, k, 0) to 
construct G' , and let R be the phylogenetic tree that is returned from calling 
PROCESSCherries*(5', r,A^). Since \g\ < it follows from Lemma [l] that 
R contains a cherry {a,c}. Let C{a) c X and C{c) c X he the label sets of 
the leaf vertices a and c, respectively, in i?, and let a' e C{a) and c' e C{c). 
Furthermore, if {a,c} is a contradicting cherry for R and Q' and a ~g' c, let 
C{B) c X be the union of labels of all leaf vertices that are contained in the 
pendant subtree below es in G' . Note that, since G' is a super-forest for !F, 
we have that there exist two elements Fi,Fj e T , not necessarily distinct, such 
that £ C{Fi) and £(c) £ ^{Fj). The rest of the proof distinguishes two 
cases depending on whether {a, c} is a contradicting or common cherry for R 
and G'- 

First, suppose that {a,c} is a contradicting cherry for R and G' ■ To derive 
a contradiction, assume that Gi+i does not contain any reduced forest that is a 
super-forest of T. In particular, this implies that deleting any edge associated 
with {a, c} is a bad choice for G' and since no resulting forests is a super- 
forests for although they all satisfy Condition (2) in the definition of a 
super-forest. Thus, one of the following holds: 

(1) a c and both edges {ca} and {ed are bad choices for G' and !F; 

(2) a ~g' c and all edges {cq}, {bb} and {ed are bad choices for G' and T . 

Case (1). Observe that neither G' - {e-a) nor G' - {ed is a super-forest of T . 
This implies that T does not contain an element Fi such that C{a) = C{Fi) or 
C{c) = C{Fi). Thus T contains two distinct components Fj and Fk such that 
C{a) c C{Fj) and C{c) c £{Fk), and for which there exist elements x,y € X 
such that a; e £{Fj), x i C{a), y e £{Fk), and y i >C(c). By construction, 
each of x and y is contained in a label of a distinct leaf in R. Now, recalling 
that {a,c} is a cherry of i?, we have that \c&ji{a',c',x,y) is an ancestor of 
lca/{(a',c') and, therefore, lcas(a', c', a;, ?/) is an ancestor of lcas(a',c'). Fur- 
thermore, as a',x € C{Fj) and c',?/ 6 £{Fk), it now follows that S{£{Fj)) and 
S{£-{Fk)) do both have the vertex lcas(a',c') in common; thereby contradict- 
ing that T is an agreement forest for S and T. 

Case (2). Observe that no forest in {G' - {ea},G' - {eB},G' - {sc} is a super- 
forests of JF. This implies that T does not contain any element Fi such that 
C{a) = C{F,) or C{c) = C{Fi) or a subset j'^' of J" such that C{B) = 
We next consider three subcases. 

First, assume that T contains a component Fj such that C{a) c C{Fj), 
C{c) c C{Fj) and there exists at least one element in the intersection C{B) n 
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C{Fj). Let b' be such an element. By construction, each of a', b', and c' is 
contained in a label of a distinct leaf in R. Now, recalling that {a, c} is a 
cherry of R, we have that lcaij(o', 6', c') is an ancestor of lca.f{{a' ,c') and, 
therefore, \cas{a' ,b' ,c') is an ancestor of lca5(a', c'). On the contrary, let Gk 
be the element of G' that contains the leaf labeled C{a) and the leaf labeled 
C(c). Since Gk also contains a leaf whose label contains b' and due to the 
definition of e^, we have that Ica^^ (a', 6', c') is an ancestor of Icagj. (a', &') 
or lcaGt,(c',6') and, therefore, lcaT(a', 6', c') is an ancestor of lcaT(a',6') or 
lcaT(c',6')- Thus, S\{a',b',c'} $ T\{a',b',c'}; thereby contradicting that is 
a maximum-acyclic-agrecmcnt forest for S and T. 

Second, assume that contains a component Fj such that £(a) c £{Fj), 
C{c) c C{Fj), and C(B) r\ C{Fj) = 0. Then, since there exists no subset !F' 
of T such that C{B) = C{T'), there exists a distinct element Fk e T such 
that b' € C{Fk) for any b' € C{B) and there exists an element x € X for which 
X e C(Fk) and a; ^ jC(B). Let Gfe be the element of 0' that contains the leaf 
labeled C{a). Clearly, Gk also contains the leaf labeled C{c) and the leaf whose 
label contains 6'. Furthermore, since 5 is a super-forest for ^, note that Gk 
contains a leaf whose label contains x. Furthermore, by the definition of cb, 
we have that the Ica^j^ (&', .x) lies on the path from the leaf labeled a' to the 
leaf labeled c' in Gk and, therefore, the lcaT(^'',a;) lies on the path from the 
leaf labeled C{a) to the leaf labeled £(c) in T. Now, it is easily checked that 
Fj and Fk arc not vertex-disjoint in T; thereby contradicting that is an 
agreement forest for S and T. 

Third, assume that !F contains two components Fj and Fk such that 
C{a) c C{Fj) and C{c) c C{Fk). Hence, there exist elements x,y e X such 
that X € C{Fj), X i C{a), y € £(Fk), and y i C{c). Note that x or y may or 
may not be elements of C{B). In this case, it is straightforward to see that we 
can derive the same contradiction as in Case (1). 

By combining Cases (1) and (2), we deduce that there exists a super- forest 

of !F that can be constructed from Q' by deleting one of {ea,ec} if a +g' c or 
one of {ea,eB,ec} if a c. Thus, this super-forest is an element of Gi+i- 

Second, suppose that {a,c} is a common cherry for R and G'. Again, to 
derive a contradiction, assume that Gi+i does not contain any reduced forest 
that is a super-forest of T. In particular, this implies that Ca, Cc, and {a,c} 
are all bad choices for G' and !F. Thus, similar to Case (1), !F contains two 
distinct components Fj and Fk such that £(a) c C{Fj) and £(c) c £{Fk), 
and for which there exist elements x,y & X such that x e C{Fj), x i jC{a), 
y 6 C{Fk), and y i C{c). Applying the same argument as in Case (1), this 
contradicts that the elements of are vertex-disjoint in S. Thus, one of Ca, 
Cc, or {a, c} is not a bad choice for G' and !F. If Ba or Sc is not a bad choice for 
G' and JF, then Gi+i clearly contains a forest that is a super- forest for T. On 
the other hand, if Ca and Cc are both bad choices for G' and then {a, c} is 
not such a choice. Hence, calling CHERRYREDUCTiON(ii, G\ M, {a, c}) returns 
a forest G" such that G" is a super-forest for J^. Note that the underlying 
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forest of Q" is Q. Since \Q\ < |JF|, it follows from Lemma [T] that after some 
additional recursions of ALlMAAFs, the algorithm chooses a cherry in line 10 
of the pseudocode of ALlMAAFs and subsequently deletes an edge in order 
to obtain a forest of size \Q\ + 1. Then by applying the arguments of Cases (1) 
and (2), and the argument of this paragraph (depending on the type of cherry 
the algorithm has chosen), it is easily checked that Qi+i contains a forest that 
is a super-forest of T. This completes the proof of the theorem. 

5 Running time of the algorithm 

In this section, we detail the running time of the algorithm ALlMAAFs. 

Theorem 3 Let S and T he two rooted binary phylogenetic X-trees, and let 
k he an integer. The running time of ALlMAAFs{S, T, S,T,k,0) is 0(3''^'). 

Proof Let be a forest for T that has been obtained from T by deleting n 
edges. Recall that ALlMAAFs stops when = (see Remark Uj). It is easy 
to see that, \X\ - n - 1 cherry reductions are needed to reduce to a forest, 
say !F', such that J^' = 0. Thus, the number of recursive calls is OdrYj). Since 
ALLMAAFs is called for at most 3 times from within each recursion, it now 
follows that the running time of ALlMAAFs{S, T, R,T,k, M) is 0(3l'^l) as 
claimed. 

While the worst-case running time that is presented in Theorem [3] is purely 
theoretical, it can be significantly optimized in the following way. Bordewich 
and Semple [4] showed that the problem of calculating the minimum number 
of hybridization events that is needed to simultaneously explain two rooted 
binary phylogenetic A'-trees S and T is fixed-parameter tractable. They used 
two reductions — called the subtree and chain reduction — to establish this re- 
sult. Loosely speaking, these reductions replace different types of features that 
are common to S and T with a small number of new leaves, thereby shrinking 
the original trees to their respective cores while preserving their hybridiza- 
tion number in a well-defined way. In fact, these two reductions are sufficient 
to yield a kernelization of the above-mentioned problem. More precisely, it is 
shown in ^ Lemma 3.3] that, by repeatedly applying the subtree and chain 
reductions to S and T until no further reduction is possible, the leaf set size 
of the so-obtained rooted binary phylogenetic trees is at most 14/i(S', T). It 
is now straightforward to see that modifying ALlMAAFs(S', T, i?, JF, fc, M) in 
the following way is sufficient to make use of this result. 

1. If i? = 5 and T = apply the subtree and chain reduction until no further 
reduction is possible and directly return (0, k-l) if the leaf set size of the 
obtained trees is greater than 14fc. 

2. Introduce a new global variable, say w, that is used to keep track of the 
weight of each initially reduced common chain of S and T (for details, 
see [4j). Additionally, whenever CHERRyExpansion is called for a for- 
est throughout a run of ALlMAAFs, also call SUBTREeExpansion and 
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CHAINExpansion to reverse each initially performed subtree and chain 
reduction, respectively. 
3. For each potential acyclic-agreement forest T' for S and T that is returned 
from calhng CHERRyExpansion, SUBTREeExpansion, and CHAInExpan- 
SION (see line 4 of the pseudocode of ALlMAAFs), do not only check if T' 
is acyclic, but also whether or not it is a so-called legitimate-agreement for- 
est (for details, see [4]). Note that this additional check can be performed 
in polynomial time. 

We denote this extended version by ALlMAAFs*(5, T, S,T,k,0). 

Now, noting that the subtree and chain reduction can be computed in 
0(n'^) for two rooted binary phylogenetic A'-trees, where n = \X\ [J], the next 
corollary is an immediate consequence of Theorem [3j and the kernelization 
ideas that are presented in [5 and briefly summarized prior to this paragraph. 

Corollary 1 Let S and T he two rooted binary phylogenetic X-trees, and let 
k he an integer. The running time of ALlMAAFs*{S, T, S,T,k,0) is 0(3"'' + 
n^), where n = \X\. 

6 Conclusions 

A topical question in current mathematical research on reticulate evolution 
is how to construct all rooted hybridization networks that display a pair of 
rooted binary phylogenetic trees such that the number of hybridization ver- 
tices is minimized. In this paper, we have made a first step towards achieving 
this goal by developing the first non-naive algorithm — called ALlMAAFs — 
that computes all maximum-acyclic-agreement forests for two rooted binary 
phylogenetic trees on the same taxa set. While this paper describes the theo- 
retical framework of ALlMAAFs and establishes the algorithm's correctness, 
a practical implementation is published in a forthcoming paper [1] and freely 
available as part of Dendroscope 0. Note that despite the worst-case run- 
ning time of ALlMAAFs, the algorithm seems to perform well in practice [T] 
for simulated and biological data sets. It is part of ongoing research to extend 
the algorithm HybridPhylogeny [3] in order to compute all possible hy- 
bridization networks that display a pair of rooted phylogenetic trees and whose 
number of hybridization vertices is minimized for when a maximum-acyclic- 
agreement forest for these two trees given. In combination with ALlMAAFs, 
such an algorithm will then compute all possible minimum hybridization net- 
works that display a pair of phylogenetic trees. 
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